NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

LLMs Can Reason Faster Only If We Let Them

Sel, Bilgehan; Huang, Lifu; Ramakrishnan, Naren; Jia, Ruoxi; Jin, Ming (June 2025, International Conference on Machine Learning (ICML))

Full Text Available
ProtAlign-ARG: antibiotic resistance gene characterization integrating protein language models and alignment-based scoring

https://doi.org/10.1038/s41598-025-14545-4

Ahmed, Shafayat; Emon, Muhit Islam; Moumi, Nazifa Ahmed; Huang, Lifu; Zhou, Dawei; Vikesland, Peter; Pruden, Amy; Zhang, Liqing (August 2025, Scientific Reports)

Full Text Available
Advancing Chart Question Answering with Robust Chart Component Recognition

https://doi.org/10.1109/WACV61041.2025.00560

Zheng, Hanwen; Wang, Sijia; Thomas, Chris; Huang, Lifu (February 2025, IEEE)

Chart comprehension presents significant challenges for machine learning models due to the diverse and intricate shapes of charts. Existing multimodal methods often over-look these visual features or fail to integrate them effectively for Chart Question Answering. To address this, we introduce CHARTFORMER, a unified framework that enhances chart component recognition by accurately identifying and classifying components such as bars, lines, pies, titles, legends, and axes. Additionally, we propose a novel Question-guided Deformable Co-Attention (QDCAt) mechanism, which fuses chart features encoded by Chart-former with the given question, leveraging the question's guidance to ground the correct answer. Extensive experiments demonstrate a 3.2% improvement in mAP over the baselines for chart component recognition. For ChartQA and OpenCQA tasks, our approach achieves improvements of 15.4% in accuracy and 0.8 in BLEU score, respectively, underscoring the robustness of our solution for detailed visual data interpretation across various applications.
more » « less
Full Text Available
Debate as Optimization: Adaptive Conformal Prediction and Diverse Retrieval for Event Extraction

https://doi.org/10.18653/v1/2024.findings-emnlp.958

Wang, Sijia; Huang, Lifu (November 2024, Proceedings of the conference Association for Computational Linguistics Meeting)
Al-Onaizan, Y; Bansal, M; Chen, Y (Ed.)
We propose a multi-agent debate as optimization (DAO) system for event extraction, where the primary objective is to iteratively refine the large language models (LLMs) outputs through debating without parameter tuning. In DAO, we introduce two novel modules: the Diverse-RAG (DRAG) module and the Adaptive Conformal Prediction (AdaCP) module. DRAGsystematically retrieves supporting information that best fits the debate discussion, while AdaCP enhances the accuracy and reliability of event extraction by effectively rejecting less promising answers. Experimental results demonstrate a significant reduction in the performance gap between supervised approaches and tuning-free LLM-based methods by 18.1% and 17.8% on ACE05 and 17.9% and 15.2% on CASIE for event detection and argument extraction respectively.
more » « less
Full Text Available
DiPT: Enhancing LLM Reasoning through Diversified Perspective-Taking

https://doi.org/10.18653/v1/2025.findings-naacl.356

Just, Hoang Anh; Dabas, Mahavir; Huang, Lifu; Jin, Ming; Jia, Ruoxi (January 2025, Association for Computational Linguistics)

Full Text Available
Targeted Augmentation for Low-Resource Event Extraction

https://doi.org/10.18653/v1/2024.findings-naacl.275

Wang, Sijia; Huang, Lifu (June 2024, Association for Computational Linguistics)

Addressing the challenge of low-resource information extraction remains an ongoing issue due to the inherent information scarcity within limited training examples. Existing data augmentation methods, considered potential solutions, struggle to strike a balance between weak augmentation (e.g., synonym augmentation) and drastic augmentation (e.g., conditional generation without proper guidance). This paper introduces a novel paradigm that employs targeted augmentation and back validation to produce augmented examples with enhanced diversity, polarity, accuracy, and coherence. Extensive experimental results demonstrate the effectiveness of the proposed paradigm. Furthermore, identified limitations are discussed, shedding light on areas for future improvement.
more » « less
Full Text Available
RE2: Region-Aware Relation Extraction from Visually Rich Documents

https://doi.org/10.18653/v1/2024.naacl-long.484

Ramu, Pritika; Wang, Sijia; Mouatadid, Lalla; Rimchala, Joy; Huang, Lifu (June 2024, Association for Computational Linguistics)

Current research in form understanding predominantly relies on large pre-trained language models, necessitating extensive data for pre-training. However, the importance of layout structure (i.e., the spatial relationship between the entity blocks in the visually rich document) to relation extraction has been overlooked. In this paper, we propose REgion-Aware Relation Extraction (\bf{RE^2}) that leverages region-level spatial structure among the entity blocks to improve their relation prediction. We design an edge-aware graph attention network to learn the interaction between entities while considering their spatial relationship defined by their region-level representations. We also introduce a constraint objective to regularize the model towards consistency with the inherent constraints of the relation extraction task. To support the research on relation extraction from visually rich documents and demonstrate the generalizability of \bf{RE^2}, we build a new benchmark dataset, DiverseForm, that covers a wide range of domains. Extensive experiments on DiverseForm and several public benchmark datasets demonstrate significant superiority and transferability of \bf{RE^2} across various domains and languages, with up to 18.88% absolute F-score gain over all high-performing baselines
more » « less
Full Text Available
Generating A Crowdsourced Conversation Dataset to Combat Cybergrooming

Zhang, Xinyi; Frohlich, Jake; Wisniewski, Pamela J; Cho, Jin-Hee; Huang, Lifu; Lee, Sang Won (May 2024, The Methods for Family-Centered Design Workshop at the CHI Conference on Human Factors in Computing Systems (CHI ’24))

Cybergrooming emerges as a growing threat to adolescent safety and mental health. One way to combat cybergrooming is to leverage predictive artificial intelligence (AI) to detect predatory behaviors in social media. However, these methods can encounter challenges like false positives and negative implications such as privacy concerns. Another complementary strategy involves using generative artificial intelligence to empower adolescents by educating them about predatory behaviors. To this end, we envision developing state-of-the-art conversational agents to simulate the conversations between adolescents and predators for educational purposes. Yet, one key challenge is the lack of a dataset to train such conversational agents. In this position paper, we present our motivation for empowering adolescents to cope with cybergrooming. We propose to develop large-scale, authentic datasets through an online survey targeting adolescents and parents. We discuss some initial background behind our motivation and proposed design of the survey, such as situating the participants in artificial cybergrooming scenarios, then allowing participants to respond to the survey to obtain their authentic responses. We also present several open questions related to our proposed approach and hope to discuss them with the workshop attendees.
more » « less
Full Text Available
The Art of Prompting: Event Detection based on Type Specific Prompts

https://doi.org/10.18653/v1/2023.acl-short.111

Wang, Sijia; Yu, Mo; Huang, Lifu (August 2023, Association for Computational Linguistics)

We compare various forms of prompts to represent event types and develop a unified framework to incorporate the event type specific prompts for supervised, few-shot, and zero-shot event detection. The experimental results demonstrate that a well-defined and comprehensive event type prompt can significantly improve event detection performance, especially when the annotated data is scarce (few-shot event detection) or not available (zero-shot event detection). By leveraging the semantics of event types, our unified framework shows up to 22.2% F-score gain over the previous state-of-the-art baselines.
more » « less
Full Text Available
Learning from a Friend: Improving Event Extraction via Self-Training with Feedback from Abstract Meaning Representation

https://doi.org/10.18653/v1/2023.findings-acl.662

Xu, Zhiyang; Lee, Jay Yoon; Huang, Lifu (August 2023, Association for Computational Linguistics)

Data scarcity has been the main factor that hinders the progress of event extraction. To overcome this issue, we propose a Self-Training with Feedback (STF) framework that leverages the large-scale unlabeled data and acquires feedback for each new event prediction from the unlabeled data by comparing it to the Abstract Meaning Representation (AMR) graph of the same sentence. Specifically, STF consists of (1) a base event extraction model trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions as pseudo training samples, and (2) a novel scoring model that takes in each new predicted event trigger, an argument, its argument role, as well as their paths in the AMR graph to estimate a compatibility score indicating the correctness of the pseudo label. The compatibility scores further act as feedback to encourage or discourage the model learning on the pseudo labels during self-training. Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+, and ERE, demonstrate the effectiveness of the STF framework on event extraction, especially event argument extraction, with significant performance gain over the base event extraction models and strong baselines. Our experimental analysis further shows that STF is a generic framework as it can be applied to improve most, if not all, event extraction models by leveraging large-scale unlabeled data, even when high-quality AMR graph annotations are not available.
more » « less
Full Text Available

« Prev Next »

Search for: All records